Information extraction for enhanced access to disease outbreak reports
نویسندگان
چکیده
Document search is generally based on individual terms in the document. However, for collections within limited domains it is possible to provide more powerful access tools. This paper describes a system designed for collections of reports of infectious disease outbreaks. The system, Proteus-BIO, automatically creates a table of outbreaks, with each table entry linked to the document describing that outbreak; this makes it possible to use database operations such as selection and sorting to find relevant documents. Proteus-BIO consists of a Web crawler which gathers relevant documents; an information extraction engine which converts the individual outbreak events to a tabular database; and a database browser which provides access to the events and, through them, to the documents. The information extraction engine uses sets of patterns and word classes to extract the information about each event. Preparing these patterns and word classes has been a time-consuming manual operation in the past, but automated discovery tools now make this task significantly easier. A small study comparing the effectiveness of the tabular index with conventional Web search tools demonstrated that users can find substantially more documents in a given time period with Proteus-BIO.
منابع مشابه
The MiTAP System for Monitoring Reports of Disease Outbreak
The MiTAP system [Damianos et al. 2002a, 2002b, 2003a, 2003b; MiTAP 2001] was developed as an experimental prototype using human language technologies for monitoring infectious disease outbreaks and other global disasters. MiTAP is designed to provide timely multi-lingual information access to analysts, medical experts, health services, and individuals involved in humanitarian assistance and re...
متن کاملUsing Hedges to Enhance a Disease Outbreak Report Text Mining System
Identifying serious infectious disease outbreaks in their early stages is an important task, both for national governments and international organizations like the World Health Organization. Text mining and information extraction systems can provide an important, low cost and timely early warning system in these circumstances by identifying the first signs of an outbreak automatically from onli...
متن کاملA Business Model to Detect Disease Outbreaks
Introduction: Every year several disease outbreaks, such as influenza-like illnesses (ILI) and other contagious illnesses, impose various costs to public and non-government agencies. Most of these expenses are due to not being ready to handle such disease outbreaks. An appropriate preparation will reduce the expenses. A system that is able to recognize these outbreaks can earn ...
متن کاملTraditional Medicine: The Need to Revive and Return to the Past amid the Outbreak of Coronavirus
Importance of this therapeutic method in Iran is more prominent than other countries in the world for two main reasons; the first one is plant richness, biodiversity, having 11 climates of 13 world-known climates, and diversity of 8000 plant species that are considered as an exclusive capacity in Iran. The second is considering the possibility of inadequate access to medicine at the internation...
متن کاملPsychological Factors Affecting on the Culture and Awareness of Cyber Security in During of Covid-19 Outbreak
The aim of this study was to investigate the psychological factors affecting the culture and awareness of cyber security in the period of Covid-19 outbreak by qualitative method and theme analysis. Research data from upstream documents that include all valid articles published in 2020 to 2022 inside and outside the country, with 4 keywords (culture, awareness, cyber security, psychological fact...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of biomedical informatics
دوره 35 4 شماره
صفحات -
تاریخ انتشار 2002